Corpus: deu_wikipedia_2021_300K

Other corpora

5.1.18 Words nearly always as next neighbors

Strong NN co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/NN_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency as NN Qoutient
Des Weiteren 366 402 343 1.25
Los Angeles 171 128 127 1.36
Buenos Aires 42 43 41 1.07
Tel Aviv 25 21 21 1.19
Sri Lanka 28 20 20 1.40
cabecera municipal 10 13 10 1.30
hervortretenden Sporenlager 14 10 10 1.40
Parzellar Katasters 5 6 5 1.20
Nuestra Señora 7 6 6 1.17
Addis Abeba 5 5 5 1.00
Burkina Faso 7 5 5 1.40
Palika Parishad 4 5 4 1.25
Nossa Senhora 6 5 5 1.20
Heutiger Titelinhaber 6 5 5 1.20
versicherungspflichtigen Beschäftigungsverhältnis 3 4 3 1.33
Swachh Bharat 3 4 3 1.33
České Budějovice 5 4 4 1.25
Shotgun Houses 3 4 3 1.33
Komischen Oper Berlin 3 4 3 1.33
Nagar Palika 5 4 4 1.25
969 msec needed at 2021-06-12 04:13